Learning a Multi-View Stereo Machine
نویسندگان
چکیده
We present a learnt system for multi-view stereopsis. In contrast to recent learning based methods for 3D reconstruction, we leverage the underlying 3D geometry of the problem through feature projection and unprojection along viewing rays. By formulating these operations in a differentiable manner, we are able to learn the system end-to-end for the task of metric 3D reconstruction. End-to-end learning allows us to jointly reason about shape priors while conforming to geometric constraints, enabling reconstruction from much fewer images (even a single image) than required by classical approaches as well as completion of unseen surfaces. We thoroughly evaluate our approach on the ShapeNet dataset and demonstrate the benefits over classical approaches and recent learning based methods.
منابع مشابه
Dictionary Learning in Stereo Imaging
This paper presents a new method for learning overcomplete dictionaries adapted to efficient joint representation of stereo images. We first formulate a sparse stereo image model where the multi-view correlation is described by local geometric transforms of dictionary atoms in two stereo views. A maximum-likelihood method for learning stereo dictionaries is then proposed, which includes a multi...
متن کاملPrioritized Multi-View Stereo Depth Map Generation Using Confidence Prediction
In this work, we propose a novel approach to prioritize the depth map computation of multi-view stereo (MVS) to obtain compact 3D point clouds of high quality and completeness at low computational cost. Our prioritization approach operates before the MVS algorithm is executed and consists of two steps. In the first step, we aim to find a good set of matching partners for each view. In the secon...
متن کاملGraph Based Semi-supervised Learning in Computer Vision
OF THE DISSERTATION Graph Based Semi-Supervised Learning in Computer Vision by Ning Huang Dissertation Director: Joseph Wilder Machine learning from previous examples or knowledge is a key element in many image processing and pattern recognition tasks, e.g. clustering, segmentation, stereo matching, optical flow, tracking and object recognition. Acquiring that knowledge frequently requires huma...
متن کاملMachine learning of projected 3D shape
This thesis primarily investigates the potential of the Pairwise Geometric Histogram (PGH) representation as the basis of a machine learning edge and view-based 3D object recognition computer vision system. The work extends 20 years’ worth of associated research within the TINA computer vision research group [1]. PGHs have formerly been engineered as a solution to the presented problem, directl...
متن کاملDemo: A Multimodal Learning Interface for Sketch, Speak and Point Creation of a Schedule Chart
We present a video demonstration of an agent-based test bed application for ongoing research into multi-user, multimodal, computer-assisted meetings. The system tracks a two person scheduling meeting: one person standing at a touch sensitive whiteboard creating a Gantt chart, while another person looks on in view of a calibrated stereo camera. The stereo camera performs real-time, untethered, v...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017